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[57] ABSTRACT 

A neural network based improving the performance of an 
omni-font classifier by using recognized characters for addi- 
tional training is presented. The invention applies the out- 
puts of the hidden layer nodes of the neural net as the feature 
vector. Characters that are recognized with high confidence 
are used to dynamically train a secondary classifier. After the 
secondary classifier is trained, it is combined with the 
original main classifier. The invention can re-adjust the 
partition or boundary of feature space, based on on-line 
learning, by utilizing the secondary classifier data to form an 
alternative partition location. The new partition can be 
referred to when a character conflict exists during character 
recognition. 

3 Claims, 2 Drawing Sheets 
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ON-LINE LEARNING FOR NEURAL NET- the weights for successive layers amoQg the weights stored 

BASED CHARACTER RECOGNITION in the processors of the hidden layer can be made by the 

SYSTEMS processors independently from each other. 

TTvn^xrrrrw US - Pat - No * 5 > 195 » 169 > granted to Kamiya et al., 

FIELD OF THE INVENTION 5 describes a ^^0! device for controlling the learning of a 

The invention is generally related to neural networks and, neural networks wherein weight values of synapse connec- 

more particularly, to an on-line unsupervised neural net- tions between units of the neural network are monitored 
based learning system which improves the performance of during learning so as to update the weight values, 

□eural network character recognition systems by using rec- U.S. Pat. No. 5,214,747, granted to Cok, describes a 

ognized characters for additional training. 10 neural network wherein a memory array stores the weights 

nAr^onriMr. nc the r^xmnM ap P Hcd to cach io P llt via a multiplier, and the multiphed 

BACKGROUND OF THE INVENTION mput5 m accumulatcd m6 applicd to a lookup table ^ 

Artificial neural networks (or simply "Neural Networks") performs threshold comparison operations, 

and their applications are well known in the art "Neural" Neural networks have been joined to standard large opti- 

comes from neurons, or brain cells. Neurons are built by 15 cal character recognition systems to improve their recogni- 

imitating in software or silicon the structure of brain cells tion. Older systems used for automated processing of forms 

and the toree-dimensional lattice of connections among and documents were usually limited to reading typed or 

them. Another technique utilizes mathematical algorithms or clearly printed numbers and block letters. Neural networks 

formulas to accomplish pattern recognition. Neural net- improve accuracy and recognize diverse characters. Neural 

works have a remarkable ability to discern patterns and 20 networks have been used extensively for character recogni- 

trends too subtle or too complex for humans, much less tion in both font specific and omni-font classifiers, 

conventional computer programs, to spot. Neural networks Generally, character classification can be considered as a 

can perceive correlations among hundreds of variables, partition of the feature space into several subspaces, each of 

recognizing patterns, making associations, generalizing which corresponds to a particular character or common 

about problems yet to be experienced before, and learning 25 character feature. This is illustrated in FIG. 1, where the 

by experience. Ordinarily, computers mechanically obey feature space is two-dimensional. In this example, two 

instructions written with uncompromising precision accord- characters, say "A" and "B" (represented by small circles) 

ing to a set of rules; however, neural networks leam by are to be recognized. The two large circles represent various 

example. For example, to teach a computer associated with features of different "A"s and "B"s. The dashed line is the 

a neural network to tell the differences between good or bad partition of the space of the classifier. Overlapping of the 

symbols, patterns, characters, etc., examples must be learned larger circles implies that certain character samples are 

by the neural network. Once the neural network has seen indistinguishable in this 2-D feature space. Consequently, 

enough examples, the neural network can direct another even a perfect classifier will make errors. The characters 

device to react appropriately to subsequent examples. ^ from a single document are likely to be from one or a few 

In an article by Richard P. Lippmann entitled "Neural fonts, and have similar distortions. As a result, the characters 

Nets for Computing", IEEE International Conference on from a single document tend to cluster in feature space, as 

Acoustics, Speech, and Signal Processing, 1988, three feed- shown in FIG. 1 by small circle "A", 

forward models are described. Single and multi-layer per- While the prior art provides for character recognition, the 

cep tions which can be used for pattern classification are ^ art does not provide a means for reducing classifications 

disclosed by Lippmann, as well as Kohonen's feature map errors during character recognition. Therefore, there is a 

algorithm, as described by Kohonen et al. in "Phonotopic need for a system which reduces classification errors during 

Maps — Insightful Representation of Phonological Features character recognition using neural networks. There is also a 

for Speech Representation/' Proceedings IEEE 7th Interna- need for a system which reduces classification error during 

tional Conference on Pattern recognition, Montreal, Canada, 45 recognition of characters, patterns, symbols, or any other 

1984, which can be used for clustering or as a vector medium well known in the neural network art. 

quantizer. Therefore, the object of this invention is to provide a 

U.S. Pat. No. 5,245,697, granted to Suzuoka describes a system and method that can reduce classification errors by 

neural network processing apparatus that calculates average temporarily re-adjusting the classification boundaries during 

absolute values and utilizes the resulting average differences 50 character recognition or the recognition of other medium. It 

between the output values and a center value to set each is further object of the invention to provide a means and 

neuron to a controlled function which allows the neuron method that can take advantage of on-line unsupervised 

network to correctly identify an unknown multivalued image neural network learning techniques to adjust the classifica- 

pattem. tion boundaries during the recognition process. 

U.S. Pat. No. 5,222,194, granted to Nishimura, describes 55 It should be appreciated that neural networks have many 

another method of modifying neuron weights and reaction applications in the character, object, symbol, or pattern 

coefficients by changing input and output characteristics of recognition art. The following disclosure will refer to all 

units and weights of links in accordance with outputs of the possible medium as "characters" which should not be taken 

output layer and a particular rule. in a limiting sense with regard to the applicability of the 

U.S. Pat. No. 5,212,767, granted to Higashino et al., 60 teachings herein. All of the references cited herein are 

describes a multi-layer neural network having input, hidden incorporated by reference for their teachings. 

and output layers and a learning method wherein a processor „ T „ , w A rt „ m „ _ „ m ^ 

belonging to the biddea layer stores both the factors of SUMMARY OF THE INVENTION 

multiplication or weights of links for a successive layer To achieve the foregoing objects of the invention, and to 

nearer to the input layer and the factors of multiplication or 65 overcome the shortcomings discussed above regarding clas- 

weights of links for a preceding layer nearer to the output sification errors during character recognition, an on-line 

layer. Upon forward or backward calculation, the access to unsupervised learning algorithm is presented which 
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improves the performance of an omni-font classifier by quent recognition of characters. The inventive method uti- 
lising recognized characters for additional training. The lized by the present invention can be broken into two major 
invention applies the outputs of the hidden layer nodes of the steps: 1) initial training and 2) on-line continuous learning, 
neural net as the feature vector. Characters that are recog- First the system is generally trained for omni-fonts and 
nized with high confidence are used to dynamically train a 5 particular characters or symbols (hereinafter "characters") 
secondary classifier. This secondary classifier can be a that it must recognize. These characters typically do not 
neural net layer, simple distance classifier, or some more have flaws, variations or deviations. This training is typical 
complex classifier. After the secondary classifier is trained, in the character or pattern recognition art. Then the system 
it is combined with the original main classifier. The inven- learns subsequent characters on-line in order to provide an 
tion can re- adjust the partition located within the feature 10 adaptable tool that can better recognize variations in char- 
space of the neural network based on on-line learning by acters or characteristics of a particular font that are some- 
utilizing the secondary classifier data to form an alternative times interferred with by, for example, noise. Relying on 
partition location. The new partition can be referred to when boundaries of character positions that are in memory, the 
there is a conflict in character recognition — where the char- system compares newly acquired characters against the 
acter is classified with low confidence. 15 stored boundaries. Feature extraction from a bit map occurs 

The Advantages of the invention are the improvement of to determine which class the character falls into within the 

classification systems by using on-line unsupervised learn- feature space. The character is classified based on feature 

ing through artificial neural network technology. By using an extraction and comparison. It is at this point that error in 

artificial neural network with the adaptive technique dis- recognition may be made. The reason for the errors are 

closed herein, error rates in optical character recognition are 20 because features of the character fall outside the partitioned 

reduced. boundary of which it belongs. Referring to FIG. 2, two 

The character recognition system uses means known in characters, "A" and "B", represented by the smaller circles 

the art for capturing characters to be learned and analyzed by ' m to be recognized within the feature space which is 

an artificial neural network system. The artificial neural two-dimensional. The two larger circles represent various 

network system of the present invention includes primary 25 features of different "A"s and "B"s known by the system, 

and secondary classifications. Training characters are ini- ^ da shed line is the standard/primary partition 1 of the 

tiaily learned by the artificial neural network system through feature space for the classifier. Overlapping of the circles 

the primary classification. Subsequent characters recognized implies that certain character samples are indistinguishable 

by the character recognition system with "high confidence" m this 2-0 feature space 1. Consequently, even a perfect 

and related to the training characters are re-learned by said 30 classifier will make errors. The characters from a single 

artificial neural network system through the secondary clas- document are likely to be from one or a few fonts, and may 

sification. During operation candidate characters received by have sim^ distortions. As a result, the characters from a 

the character capturing means to be learned and recognized document tend to cluster in feature space 1, as shown 

are initially compared to the training characters within the m F 10 * 1 b y sma11 circle " A " which falls within the overlap 

primary classification and are identified (matched) if the 35 area * 

characters are recognized with high confidence. If the can- The invention applies the outputs of the hidden layer 

didate characters are not recognizable using the primary nodes of the neural net as the feature vector. Characters that 

classification, then they are compared to previously recog- are recognized with high confidence are used to dynamically 

nized characters within the secondary classification wherein train a secondary classifier. This secondary classifier can be 

the candidate characters are identified. 40 a neural net layer, simple distance classifier, or some more 

complex classifier. After the secondary classifier is trained 

SUMMARY OF THE INVENTION (typicall y 5 _ 10 S!imples pcr amaet are sufficient); it is 

The preferred embodiments and other aspects of the combined with the original main classifier. The secondary 

invention will become apparent from the following descrip- classifier results in an alternate partition 2 (shown as a solid 

tion to illustrate a preferred embodiment of the invention 45 ^nc) being formed within the feature space, 

when read in conjunction with the accompanying drawings During subsequent character recognition, a character that 

which are provided for the purpose of describing an embodi- is recognized with low confidence by the main classifier is 

ment of the invention and not for limiting same, in which: sent to the secondary classifier, defined by boundaries within 

FIG. 1 is a graphical illustration of two dimensional 5Q the secondary partition 2, for further checking. A weighted 

feature space known in the art wherein two characters are to distance classifier can be adopted as the secondary classifier, 

be recognized with the two large circles representing char- Specifically, the distance measure between the test character 

acter features of different "A J 's and "B"s, and a fixed ^d the training samples for i-th character can be calculated 
partition feature spaced is depicted by a dashed line. 

FIG. 2 is a graphical illustration of two dimensional 5S • „2 

feature space as illustrated in FIG. 1 that has been modified d^ffi-fiittU^ 

by application of the invention to provide for a alternate where g fe a test cnaracter > s : th feature) ^ md tf- are the 

partition depicted by a dashed fine. yih feature , s Mmpk mean ^ sample variance J of the 

FIG. 3 is a flow chart for the method of the invention. training data for the i-th character. Experiments have showed 

FIG. 4 is a block diagram illustrating the system compo- 60 that the combined classifier (using the secondary partition 2) 

nents for the invention. outperformed the original omni-font neural classifier in 



DETAILED DESCRIPTION OF THE 



different degrees for different test documents. 

n^cE^D^^^^n^T^Txic^^^TOc A fixed classification boundary cannot exploit the simi- 

PREFERRED EMBODIMENT OF THE , r u , ■ r . j "j* . j 

INVENTION lanty or characters in fonts and distorsrons within a docu- 

65 ment. For example, referring to FIG. 1, the error or ambi- 

Ihe invention recognizes stationary fonts continuously guity of seen in populations based on overlap within feature 

and uses already recognized characters to improve subse- space is the result of fixed partitions such as the primary 
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partition 1. As described above, the invention can readjust 
the partition based on on-line learning because an alternative 
partition location is determined by the neural network, and 
this partition can be referred to when there is ambiguity in 
character recognition. 5 

Referring to the flow chart of FIG. 3, candidate characters 
are first captured with a character capturing means known in 
the art of character recognition. The candidate characters 
must then be processed by the character recognition system 
wherein the candidate characters arc analyzed by comparing 10 
the candidate character to primary or secondary classifica- 
tion systems of the neural network to determine a match for 
the candidate characters. The test is as follows: 
If a sample character is classified with high confidence, 
then a character match is determined and provided to 15 
the character recognition system for processing; the 
sample may also be used for learning which will result 
in an alternative partition being formed for subsequent 
use. 

If the character is classified with low confidence, then the 20 
alternative partition is referred to and used resulting in 
re -classification for the character with high confidence. 
Through this test sequence, the candidate characters are 
compared by the system to training characters stored within 
the primary classification memory of the neural network to 25 
find a match if said candidate characters are recognized with 
high confidence, or are compared by the system to previ- 
ously recognized (post -training) characters that were stored 
within the neural networks secondary classification memory 
to find a match using the secondary partition if the candidate 
characters are recognized with low confidence. When a 
matching character for the candidate character is finally 
determined the character is provided through said character 
recognition system to a character generating system which 
may be, for example, memory or rendering apparatuses 35 
known in the art. 

Referring to FIG. 4, the system components required for 
the disclosed invention is generally familiar in the art of 
artificial neural networks as applied in the character recog- 
nition field. The character recognition system 11 typically 40 
comprises a neural network 12 having a primary classifica- 
tion embodied within a computer system 13 and a means for 
capturing 16 or otherwise providing a character to be 
recognized. Distinct features of the invention, however, 
further provide a secondary classification 15 as part of the 
neural network 12. Training characters are initially learned 
by said artificial neural network system and stored in 
(remembered by) said primary classification. Characters 
recognized by said character recognition system with high 
confidence and are related to the training characters are 50 
re-learned by the artificial neural network system and are 
then stored in (remembered by) the secondary classification 
15. Character analysis occurs with components known in the 
art for analyzing and recognizing candidate characters; 
however, the system disclosed herein further analyzes can- 
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didate characters received by known means for capturing 
characters 16 to be learned and recognized by enabling a 
comparison to training characters of the primary classifica- 
tion or recognized characters of the secondary classification 
15 depending on whether the candidate character is recog- 
nized with high confidence or low confidence as described 
above. Means for outputing and generating recognized char- 
acters 14 through said character recognition system are also 
known in the art and can be applied to the candidate 
character once recognized by the invention. 

While the invention is described with reference to a 
particular embodiment, this particular embodiment is 
intended to be illustrative, not limiting. Various modifica- 
tions may be made without departing from the spirit and 
scope of the invention as denned in the amended claims. 
Modifications and alterations will occur to others upon 
reading and understanding this specification; therefore, it is 
intended that all such modifications and alterations are 
included insofar as they come within the scope of the 
appended claims or equivalents thereof. 

We claim: 

1. A character recognition system, comprising: 
means for capturing characters to be analyzed; 

an artificial neural network system including at least one 
primary classifier comprising a primary classification 
memory means and at least one secondary classifier 
comprising a secondary classification memory means, 
said artificial neural network system learning a first set 
of characters and storing the first set of characters in 
said primary classifier and said artificial neural network 
system storing a second set of characters related to the 
first set of characters in said secondary classifier; and 

a character analyzer for analyzing candidate characters 
captured by said capturing means, said character ana- 
lyzer causing candidate characters to be compared with 
said first set of characters stored in said primary clas- 
sifier when said candidate characters are recognizable 
with a high degree of confidence said high degree of 
confidence being used to dynamically train said sec- 
ondary classifier, and said character analyzer causing 
candidate characters to be compared with said second 
set of characters stored in said secondary classifier 
when said candidate characters are recognizable with a 
low degree of confidence. 

2. The character recognition system of claim 1, in which 
a candidate character matches a character in one of the first 
set and the second set, further comprising a character 
rendering subsystem for rendering the candidate character 
which matches the character in the one of the first set and the 
second set. 

3. The character recognition system of claim 1, wherein 
the second set of characters in the second classification 
memory are learned twice in said artificial neural network. 

4 * * * * 
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